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Abstract  -  We  consider  in  this  paper  local  sensor 
quantizer  design  for  large-scale  bandwidth  and/or 
energy  constrained  wireless  sensor  networks  (WSNs) 
operating  in  fading  channels.  In  particular,  under 
the  Neyman- Pear  son  framework,  we  address  the  de¬ 
sign  of  binary  local  sensor  quantizers  for  a  binary 
hypothesis  problem  in  the  asymptotic  regime  where 
the  number  of  sensors  is  large.  Motivated  by  the  sen¬ 
sor  censoring  idea  for  reduced  communication  rate, 
each  sensor  either  transmits  ‘1  ’  to  a  fusion  center  or 
remains  silent.  By  adopting  energy  detector  as  the 
fusion  rule,  we  develop  a  procedure  to  obtain  local 
sensor  threshold  that  maximizes  the  Kullback-Leibler 
distance  of  the  distributions  of  the  fusion  statistic 
under  the  two  hypotheses.  The  proposed  quantizer 
design  is  well  suited  for  the  emerging  large  scale 
resource- constrained  WSNs  applications.  Numerical 
results  based  on  Gaussian  and  exponential  observa¬ 
tions  are  presented  to  demonstrate  the  design  proce¬ 
dure. 

Keywords:  Wireless  sensor  networks,  distributed  detec¬ 
tion,  asymptotic  regime,  censoring  sensors. 

1  Introduction 

The  design  of  local  quantizer  for  distributed  detection 
under  communication  rate  constraint  has  been  studied 
over  the  past  decades  (see,  e.g.,  [1  7]).  For  example, 
the  sensor  censoring  idea,  was  proposed  by  Rago  et 
al  in  1996  [1]  in  the  context  of  decentralized  detec¬ 
tion  for  reduced  communication  rate.  With  censored 
sensors,  only  the  sensor  with  informative  observation, 
measured  by  its  local  likelihood  ratio  (LR)  value,  sends 
its  LR  value  to  the  fusion  center.  Under  Neyman- 
Pearson  framework,  an  extreme  censoring  scheme  with 
an  on/off  local  sensor  signaling  structure  has  been  con¬ 
sidered  in  [3]  in  the  context  of  studying  locally  opti¬ 
mum  distributed  detection,  where  if  the  local  LR  ex¬ 
ceeds  certain  threshold,  then  a  single  bit  of  informa¬ 
tion  is  sent;  otherwise,  the  sensor  keeps  silent.  More 
recently,  under  the  Bayesian  framework,  the  design  of 
optimal  local  thresholds  for  distributed  detection  un¬ 
der  such  on/off  signaling  structure  was  studied  in  [7]. 

In  this  paper,  we  consider  the  Neyman-Pearson 
framework  and  address  the  optimal  binary  local  sen¬ 


sor  quantizer  design  for  a  binary  hypothesis  testing 
problem.  In  particular,  we  study  the  asymptotic  case 
where  the  number  of  sensors  is  large.  Using  an  energy 
detector  as  the  fusion  statistic  at  the  fusion  center, 
we  propose  a  procedure  to  determine  the  optimal  lo¬ 
cal  threshold  through  maximizing  the  Kullback-Leibler 
(KL)  distance  of  the  distributions  of  the  fusion  statis¬ 
tic  under  the  two  hypotheses.  Specifically,  we  develop 
efficient  algorithms  to  facilitate  the  determination  of 
the  optimal  thresholds  for  various  scenarios,  catering 
to  different  constraints. 

The  organization  of  the  paper  is  as  follows.  In  the 
next  section,  we  introduce  the  system  model  and  for¬ 
mulate  the  problem.  In  Section  3,  we  pose  the  design 
problem  as  maximizing  the  KL  distance  while  subject 
to  rate  constraints  at  local  sensors  and  present  algo¬ 
rithms  to  obtain  the  optimal  thresholds  numerically. 
Design  examples  under  Gaussian  and  exponential  dis¬ 
tributions  are  provided  in  Section  4  to  illustrate  the 
design  procedure.  We  finally  conclude  in  Section  5. 

2  Problem  formulation 

Fig.  1  depicts  a  canonical  parallel  fusion  structure 
in  the  presence  of  non-ideal  channels.  All  the  chan¬ 
nels  are  assumed  independent  Rayleigh  fading  channels 
and  corrupted  by  independent  identically  distributed 
(i.i.cl.)  complex  white  Gaussian  noises.  Specifically, 
we  use  hke^k  and  nk  to  denote  the  channel  coeffi¬ 
cient  and  channel  noise  of  the  kth  channel.  The  quan¬ 
tity  hk  follows  a  Rayleigh  distribution  with  pdf  of 
P(hk )  =  2 hke~hk,  (j)k  €  U[0,  27t]  is  the  phase  due  to  the 
transmission,  and  ri/-  is  a  zero  mean  complex  Gaussian 
noise  whose  real  and  imaginary  parts  are  independent 
of  each  other  and  have  equal  variance  a2. 

Assume  the  local  observations  X k  =  1,  2,  •  •  • ,  K, 
are  independent  across  sensors  conditioned  on  the  hy¬ 
potheses,  i.e. ,  for  *  =  0,1, 

K 

P(Xi, . .  .,XK\  Hi)  =  n  P(Xk\Hi), 

fc= l 

where  K  is  the  total  number  of  sensors  and  H0  and  H i 
represent  the  two  hypotheses.  We  further  assume  that 
each  local  sensor  makes  a  binary  decision  Uk  based  on 
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Figure  1:  Parallel  fusion  model  in  the  presence  of 
Rayleigh  fading  and  noisy  channels  between  the  local 
sensors  and  the  fusion  center 


its  observation  X k: 


The  goal  in  this  paper  is  to  obtain  the  optimal  lo¬ 
cal  decision  rules  (i.e. ,  find  the  optimal  local  sensor 
quantization  thresholds)  that  achieves  optimal  detec¬ 
tion  performance  under  the  communication  constraint. 
In  particular,  under  the  Neyman-Pearson  framework, 
we  consider  the  asymptotic  case  where  the  number  of 
sensors  is  large.  The  design  rationale  and  our  main 
results  are  provided  in  the  next  section. 


3  Asymptotic  Regime 

Throughout  this  paper,  we  consider  K ,  the  number  of 
local  sensors,  to  be  large. 

We  assume  all  sensors  adopt  likelihood  ratio  (LR) 
test  as  the  local  decision  rule  with  the  same  threshold, 
denoted  by  r.  This  is  motivated  by  the  classical  result 
[8]  where  it  was  shown  that  identical  decision  rule  is 
asymptotically  optimum  for  binary  hypothesis  testing 
with  identically  distributed  observations.  Thus,  local 
sensors  have  identical  local  performance  indexes,  which 
we  denote  by  pf  =  Pfk  and  pd  =  Pdk-  Moreover,  the 
rate  constraints  for  all  sensors  are  the  same,  i.e.,  R  = 
Pf  <  P- 

Given  the  system  model  depicted  in  Fig.  1,  the  re¬ 
ceived  signals  at  the  fusion  sensor  can  be  expressed  as, 
for  k  =  1,2,---, K, 


f  nk  if  uk  =  0 

\  hke 3<l>k  +  nk  if  uk  =  1 


(1) 


uk  =  7 k{xk)  e  {1,0} 

where  7 *,(.)  stands  for  the  local  decision  rule  at  the  kth 
sensor.  If  uk  =  1,  sensor  k  will  send  uk  to  the  fusion 
center  through  a  fading  channel;  Otherwise,  the  sensor 
remains  silent,  i.e.,  no  transmission  is  needed. 

Denote  p  as  the  communication  constraint  on  each 
sensor.  In  this  work,  we  assume  all  local  sensors 
have  the  same  communication  constraint.  Under  the 
Neyman-Person  framework,  following  the  similar  spirit 
as  in  [1],  we  define  the  communication  rate  Rk  for  the 
kth  sensor  as  the  probability  of  false  alarm  rate  at  the 
sensor.  Let  pdk  and  pfk  denote  the  detection  probabil¬ 
ity  and  the  false  alarm  rate  at  the  kth  local  sensor,  the 
communication  constraint  requires  the  following  con¬ 
dition  to  be  satisfied: 

Rk  Pfk  —  P 

for  k  =  1  We  note  that  this  rate  constraint 

is  different  from  the  constraint  on  the  false  alarm  rate 
under  the  Neyman-Pearson  test:  the  size  of  the  test  for 
a  distributed  detection  system  is  defined  for  the  fusion 
center  whereas  here  we  constrain  the  size  of  the  local 
sensors. 

At  the  fusion  center,  based  on  the  received  channel 
outputs  2/1,  •  •  • ,  Uk  ,  a  final  decision  uq  on  which  hy¬ 
pothesis  is  true  is  made  by  implementing  an  optimal 
fusion  rule  70.  That  is, 

uo  =  70 (j/i,  •  •  •  ,Vk) 


In  the  current  setup,  we  assume  that  only  chan¬ 
nel  fading  statistics  are  available  at  the  design  stage. 
Therefore,  for  incoherent  detection,  we  adopt  an  en¬ 
ergy  detector  as  the  fusion  statistic,  i.e., 

A  =  ^Ei^-i2  = 

fc= 1  fc= 1 

where  zk  =  \yk\2  and  is  independent  of  one  another  for 
k  =  K.  given  the  independence  assumptions  on 

the  channels  and  on  channel  noises.  It  is  shown  in  [5,6] 
that  the  energy  detector  is  a  good  approximate  of  the 
optimal  fusion  rule  at  low  signal-to-noise  ratio  regime. 
The  conditional  pdfs  of  zk  given  the  local  decision  uk 
are  given  by: 

1  _  zk 

P(zk\uk  =  0)  =  ^2 

P(zk\uk  =  l)  =  1  e~ 

That  is,  given  the  local  decisions  uk,  zk  s  are  i.i.d.  and 
are  exponentially  distributed  with  respective  mean  val¬ 
ues  equal  to  2cr2  and  1  +  2cr2. 

Consider  the  number  of  sensors  to  be  large,  the  Cen¬ 
tral  Limit  Theorem  allows  us  to  approximate  the  dis¬ 
tributions  of  A  as  Gaussian  distributions  under  both 
hypotheses.  The  following  lemma  describes  the  distri¬ 
butions  of  A  specifically. 

Lemma  1  In  the  asymptotic  regime,  the  conditional 
pdf  of  A  given  hypothesis  Hj  is  N(pj,crj)  for  j  =  0, 1. 


Specifically, 


Si  =  E[A\Hi]  =  Pd  +  2a2 
So  =  [A  | -Ho]  =  Pf  +  2c2 
0i  =  Var  [A|-Hi]  =  f{pd) 
al  =  Var[A\H0\  =  f(pf) 

where 

f(x)  =  ^-{(2cj2]2  +  2(1  +  2a2)x  -  x2]  (2) 

Since  pd  and  pf  are  functions  of  the  local  LR  thresh¬ 
old  r,  both  means  and  variances  of  A  under  both  hy¬ 
potheses  are  also  functions  of  r. 

The  KL  distance  (relative  entropy),  defined  as 
D(P0\\Pi)  =  Epo( log-pi1),  between  the  two  distribu¬ 
tions  under  test  is  directly  related  to  the  detection 
performance  in  an  asymptotic  regime  given  Pj  = 
P(A\Hj),  j  =  0, 1.  Stein’s  lemma  [9]  states  that  under 
the  Neyman-Pearson  framework  the  best  achievable  er¬ 
ror  exponent  in  the  probability  of  error  is  given  by  the 
KL  distance. 

In  our  setup,  for  Gaussian  distributions  Po  and  Pi, 
we  can  further  simplify  D(P0\\Pi)  as 

=  (3) 
0o  *0i 

Throughout  this  paper,  we  use  the  natural  loga¬ 
rithm  in  the  KL  distance  expression,  i.e. ,  the  measure 
unit  of  D(P0\\Pi)  is  nats.  It  is  shown  in  [9]  that  the 
KL  distance  is  nonnegative.  Clearly,  the  KL  distance 
D (Po| | Pi)  is  a  function  of  local  LR  threshold  r.  As 
such,  the  optimum  local  threshold  r  can  be  determined 
by 

t*  =  arg  maxi)  (P0|  |  Pi)  =  argmaxP(r)  (4) 

r  r 

To  accommodate  the  communication  rate  constraints, 
we  now  pose  the  design  problem  as  the  following  con¬ 
strained  maximization  problem: 

max  D(t) 

subject  to  R  =  pf  <  p 

This  is  a  nonlinear  optimization  problem  with  in¬ 
equality  constraint  [10].  In  general,  since  D(t)  is  not 
a  convex  function,  we  can  not  adopt  the  Kuhn- Tucker 
Theorem  [10]  directly  to  solve  the  above  optimization 
problem. 

The  closed-form  solution  to  the  local  sensor  thresh¬ 
old  may  not  be  obtained  directly.  In  the  current  work, 
however,  since  the  rate  function  R(t)  =  Pf(r)  is  a  de¬ 
creasing  function  as  r,  we  can  devise  an  efficient  algo¬ 
rithm  to  obtain  the  optimal  r  numerically,  as  described 
below: 

Algorithm 

1.  Obtain  r*  based  on  Eq.  (4)  where  r*  corresponds 
to  the  maximum  point  of  the  KL  distance  without 
any  rate  constraint. 

2.  Check  if  r*  satisfies  R(t*)  <  p; 


(a)  If  yes,  stop,  and  the  optimum  solution  ropt  = 
r*; 

(b)  If  not,  go  to  3). 

3.  Calculate  by  solving  R(t)  —  p  =  0.  Since 
R(t)  monotonically  decreases  as  r  increases,  the 
optimum  point  must  be  in  the  region  of  [t^p\  oo). 
Calculate  the  KL  distance  of  the  following  three 
types  of  points  within  this  region: 

(a)  the  start  point:  r ^  =  r^; 

(b)  the  end  point:  r ^  =  oo; 

(c)  the  points  i  =  1, 2,  •  •  • ,  N0,  which  sat¬ 
isfy  =  0.  The  quantity  N0  denotes  the 

total  number  of  such  points. 

4.  Compare  the  KL  distances  associated  with  the 
points  in  3).  The  optimum  solution  ropt  is  the 
threshold  which  corresponds  to  the  largest  KL  dis¬ 
tance  among  those  obtained  in  3). 

In  many  cases,  for  example,  for  local  observations 
with  Gaussian  and  exponential  distributions,  the  LR 
threshold  r  can  be  translated  directly  to  the  local  ob¬ 
servation  threshold,  denoted  as  p.  The  rate  function 
R(p)  is  thus  either  a  monotonically  decreasing  or  in¬ 
creasing  function  of  p.  In  the  former  case,  the  above 
algorithm  is  directly  applicable  with  r  replaced  by  p. 
To  deal  with  the  latter  case,  i.e.,  the  increasing  rate 
function  R(p ),  we  only  need  slightly  modify  the  algo¬ 
rithm  in  step  3).  Specifically, 

•  set  the  start  point  i/s)  =  ps,  where  7~]s  is  the 
minimum  value  of  p  associated  with  the  detection 
problem  under  consideration,  e.g.,  in  the  Gaussian 
observations  case,  ps  =  —  oo. 

•  set  the  end  point  p^  =  p^p\ 

In  the  next  section,  we  will  present  two  design  ex¬ 
amples  to  find  the  optimal  local  thresholds  using  the 
proposed  algorithms. 

4  Design  Examples 

In  this  section,  we  demonstrate  the  design  procedure 
described  in  the  previous  section  through  examples  of 
Gaussian  and  exponential  distributed  local  observa¬ 
tions. 

4.1  Gaussian  Observations  Case 

In  the  case  of  the  detection  of  a  known  signal  in  in¬ 
dependent  Gaussian  noises,  the  observations  at  local 
sensors  follow  Gaussian  distributions.  Specifically,  we 
assume  that 

Hq  ■  Xk  =  Vk 

H±  :  Xk  =  s  +  vk  (6) 

where  s  is  the  known  signal,  Vk,  k  =  1,2, 
are  i.i.d.  white  Gaussian  noises  with  zero  mean  and 


variance  cr„.  Without  loss  of  generality,  we  assume 
s  —  1  in  the  simulation. 

Notice  that  for  the  Gaussian  problem,  the  LR 
threshold  r  can  be  directly  translated  to  the  thresh¬ 
olds  for  the  local  observations  ij.  Then  the  false  alarm 
rate  pf  and  the  detection  probability  pd  at  local  sensors 
can  be  expressed  as: 

p>  =  Q(i)  (7) 

Pd  =  Q  {^r^j  (8) 

Thus,  we  further  obtain 

Hr  =  Q  ^  +  2cr2 

Mo  =  Q  +  2cr2 

*  “  '  («V>) 

=  /(<>) 

where  f(x)  is  defined  in  Ecp  (2)  in  the  previous  section. 

Given  the  form  of  pf  in  Eq.  (8),  we  can  show  that 
the  rate  function  R  =  pf  is  a  monotonically  decreasing 
function  of  p.  Hence,  the  optimal  threshold  p  can  be 
obtained  using  the  procedure  described  in  the  proposed 
algorithm.  The  simulation  results  under  different  rate 
constraint  p  are  listed  in  Table  1  at  the  channel  signal- 
to-noise  ratio  (SNR)  =  OdB. 

As  seen  from  Table  1,  the  results  obtained  us¬ 
ing  the  proposed  algorithm  and  through  exhaustive 
search  match  very  well  for  different  communication 
constraint  p.  Another  interesting  observation  is:  as 
p  increases,  the  obtained  threshold  will  eventually  re¬ 
main  unchanged.  This  is  not  surprising:  in  such  case, 
the  optimal  solution  coincides  with  that  of  the  uncon¬ 
strained  optimization. 

To  better  understand  this  behavior,  we  provide  a 
plot  of  the  KL  distance  versus  the  rate  constraint  p  in 
Fig.  2.  Three  different  curves  represent  different  chan¬ 
nel  SNRs.  The  tradeoff  between  the  communication 
rate,  the  channel  SNR  and  the  system  performance  can 
be  clearly  seen  from  the  figure.  For  small  p,  increasing 
p  improves  the  KL  distance  yet  as  p  becomes  large, 
D (Po| | Pi)  eventually  levels  off  in  all  cases.  Moreover, 
the  higher  the  channel  SNR,  the  better  the  local  quan¬ 
tization  can  do  to  improve  the  system  performance. 

4.2  Exponential  Observations  Case 

Next  we  consider  an  example  of  exponential  observa¬ 
tions  at  local  sensors.  The  conditional  distributions 
under  exponential  observations  assumption  can  be  ex¬ 
pressed  as 

f(Xk\H0)  =  poe-0oXk  (9) 

f(Xk  |ffr)  =  pie~^Xk  (10) 

where  X k  >  0,  and  without  loss  of  generality,  we  as¬ 
sume  that  0  <  Po  <  Pi- 


p 

p  (algorithm) 

p  (exhaustive  search) 

.1 

1.2816 

1.2816 

.2 

0.8596 

0.8596 

.4 

0.8596 

0.8596 

.6 

0.8596 

0.8596 

1 

0.8596 

0.8596 

Table  1:  Local  observation  thresholds  obtained  for 
Gaussian  observations. 


P 

(algorithm) 

p  (exhaustive  search) 

.1 

0.0702 

0.0702 

.2 

0.1488 

0.1488 

.4 

0.3406 

0.3406 

.6 

0.3582 

0.3584 

1 

0.3582 

0.3584 

Table  2:  Local  observation  thresholds  obtained  for  ex¬ 
ponential  observations. 

Similarly,  the  LR  threshold  r  can  be  translated  di¬ 
rectly  to  the  local  observation  threshold  in  the  expo¬ 
nential  observations  case.  The  local  decision  rule  is  in 
the  form  of  the  following: 

Uk  =  0 

Xk  %  V  (11) 

uk  =  1 

Consequently,  the  false  alarm  rate  pf  and  the  detection 
probability  pd  at  local  sensors  can  be  expresses  as 

pf  =  l~e~0ov  (12) 

Pd  =  1  —  e~0lV  (13) 

It  can  be  shown  that  the  rate  R  =  pf  is  a  monoton¬ 
ically  increasing  function  of  p.  Hence,  we  can  adopt 
the  modified  algorithm  to  obtain  the  optimal  thresh¬ 
old  p  numerically.  The  simulation  results  for  different 
rate  constraint  p  are  listed  in  Table  2  at  channel  SNR 
=  OdB. 

The  same  observations  hold  as  what  were  observed 
in  the  Gaussian  observations  example:  as  p  increases, 
the  obtained  threshold  will  eventually  remain  un¬ 
changed.  The  results  obtained  using  the  proposed  al¬ 
gorithm  and  through  exhaustive  search  in  Table  2  for 
the  exponential  observation  case  match  very  well  for 
different  p’s. 

Under  exponential  observations,  similarly,  as  shown 
from  the  plot  of  the  KL  distance  versus  the  rate  con¬ 
straint  p  in  Fig.  3,  for  small  p,  increasing  p  improves 
D(P0\\Pi)  yet  as  p  becomes  large,  there’s  a  “satura¬ 
tion”  effect  for  £>(P0||Pi). 

5  Conclusion 

In  this  paper,  under  Neyman-Person  framework  we 
have  developed  a  procedure  to  obtain  the  optimal  local 
sensor  quantization  threshold  for  a  binary  hypothesis 
testing  problem  under  communication  constraint.  In 
particular  we  consider  the  asymptotic  case  where  the 
number  of  sensors  is  large.  By  adopting  energy  detec¬ 
tor  as  the  fusion  statistic  and  applying  central  limit 


theorem,  we  pose  the  design  problem  as  maximizing 
the  Kullback-Leibler  distance  of  the  distributions  of 
the  fusion  statistic  under  both  hypothesis  while  sub¬ 
ject  to  certain  rate  constraint.  Efficient  algorithms  are 
developed  to  numerically  obtain  the  optimal  thresh¬ 
old  at  local  sensors  for  different  scenarios  of  the  rate 
function. 
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Figure  2:  Kullback-Leibler  distance  as  a  function  of 
p  at  different  channel  SNRs  under  Gaussian  observa¬ 
tions. 
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Figure  3:  Kullback-Leibler  distance  as  a  function  of  p 
at  different  channel  SNRs  under  exponential  observa¬ 
tions. 


